Discovering CNVs from read depth analysis of next generation sequencing data

نویسندگان

  • Alexej Abyzov
  • Alexander Eckehart Urban
  • Michael Snyder
  • Mark Gerstein
چکیده

Genomic Structural Variations (SV), including Copy Number Variations (CNV), are believed to contribute significantly to variations between human individuals, and may have larger effect on phenotype than SNPs [1, 2]. Therefore, the importance of their discovery has been realized. While originally detected from analysis of aCGH array data, SVs/CNVs can now be more efficiently discovered from next generation sequencing data such as Solexa/Illumina, SOLiD, Helicos etc. Here we present a novel method to detect deletions and duplications from statistical analysis of mapping density (read depth) of short reads from next generation sequencing platforms. We have adopted a mean-shift technique originally developed in computer science for image processing [3, 4] to analysis of read depth (RD) data. This approach performs the discontinuity preserving smoothing of RD signal through kernel density estimation and the mean-shift computation and results in segmentation of RD signal across genome. Then, CNVs regions are predicted from the analysis of calculated segments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An enhanced method for targeted next generation sequencing

Copy number variants (CNV) are a major cause of disease, with over 30,000 reported in the DECIPHER database. To use read depth data from targeted Next Generation Sequencing (NGS) panels to identify CNVs with the highest degree of sensitivity, it is necessary to account for biases inherent in the data. GC content and ambiguous mapping due to repetitive sequence elements and pseudogenes are the p...

متن کامل

GROM-RD: resolving genomic biases to improve read depth detection of copy number variants

Amplifications or deletions of genome segments, known as copy number variants (CNVs), have been associated with many diseases. Read depth analysis of next-generation sequencing (NGS) is an essential method of detecting CNVs. However, genome read coverage is frequently distorted by various biases of NGS platforms, which reduce predictive capabilities of existing approaches. Additionally, the use...

متن کامل

cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate

Quantitative analyses of next-generation sequencing (NGS) data, such as the detection of copy number variations (CNVs), remain challenging. Current methods detect CNVs as changes in the depth of coverage along chromosomes. Technological or genomic variations in the depth of coverage thus lead to a high false discovery rate (FDR), even upon correction for GC content. In the context of associatio...

متن کامل

CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data

Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based ...

متن کامل

Sensitive and accurate detection of copy number variants using read depth of coverage.

Methods for the direct detection of copy number variation (CNV) genome-wide have become effective instruments for identifying genetic risk factors for disease. The application of next-generation sequencing platforms to genetic studies promises to improve sensitivity to detect CNVs as well as inversions, indels, and SNPs. New computational approaches are needed to systematically detect these var...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009